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ij2 ' Abstract 

A central theme in social choice theory is that of impossibility theorems, such as Arrow's 
theorem IArr63j and the Gibbard-Satterthwaite theorem [Gib 73, Sat 75], which state that 
under certain natural constraints, social choice mechanisms are impossible to construct. In 
recent years, beginning in Kalai IKalOlj . much work has been done in finding robust versions 
^z • of these theorems, showing "approximate" impossibility remains even when most, but not 

all, of the constraints are satisfied. We study a spectrum of settings between the case where 
(-h ■ society chooses a single outcome (a-la-Gibbard-Satterthwaite) and the choice of a complete 

order (as in Arrow's theorem). We use algebraic techniques, specifically representation 
theory of the symmetric group, and also prove robust versions of the theorems that we 
state. Our relaxations of the constraints involve relaxing of a version of "independence of 
irrelevant alternatives" , rather than relaxing the demand of a transitive outcome, as is done 
in most other robustness results. 

> 

00 

en ' 1 Introduction 

en ' 



a 



Social choice deals with the aggregation of opinions of individuals in a society into a single opin- 
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£\j . ion. There are several important impossibility theorems in the field, stating that aggregation 

mechanisms satisfying some natural conditions, are dictatorial (dependent on the opinion of a 

single voter). 

• i-^ 

The first of these theorems is Arrow's theorem. Let there be a set of n individuals, who 
wish to decide on a ranking of m alternatives. Each individual has their own full ranking of the 
alternatives. Let L m be the set of full transitive linear order on [m] and O m be the set of all 
anti symmetric relations on [m]. A social welfare function (SWF) is a function / : L^ — > O m , 
that maps the individual rankings of the n voters into an aggregated relation. 

Definition 1.1: A SWF / is called 

• Independent of Irrelevant Alternatives (HA), if for every 2 alternatives a, b, the aggregated 
preference between a and b depends only on the individual preferences between a and b. 
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• Consistent, if it always returns a transitive order (is into L m ). 

Theorem 1.2: (Arrow) Form > 3, every function that is consistent and IIA, and agrees with 
unanimous votes, is dictatorial. 

Another theorem of similar flavor is Gibbard-Satterthwaite's theorem (GS), known to be 
strongly connected to Arrow's theorem. It deals with a setting in which the voters only wish 
to choose one of the m alternatives. A social choice function (SCF) is a function / : L 7 ^ — > [m], 
that maps the individual rankings of n voters into an aggregated choice. GS deals with the 
game-theoretic notion of strategy proofness, where no voter has an incentive to misreport their 
true opinion and obtain a better result from her perspective. 

For the formal definition of strategy-proofness, we introduce some notations. For a profile 
x S L^x = (xi,...,x n ) and a voter i £ [n], we will denote x = (x~ l ,Xi), where x~ % indicates 
the votes of all voters except the i'th. For y G L m , we will use < y to indicate the corresponding 
order. We similarly define > y ,> y ,<y 

Definition 1.3: A SCF / is called strategy-proof, iff 

Vi e [n],x- 1 £ V^ x , Xi ,y£ L m ,f{x~\xi) > Xi f{x~ l iy ) 

i.e. no voter, under any circumstance, has an incentive to misreport their true preference. 

Theorem 1.4: (Gibbard-Satterthwaite) for m > 3, a social aggregator f : LJ^ — > [m] that is 
onto and strategy-proof is dictatorial. 

The connection between the notions of strategy-proofness and IIA was demonstrated in 
[NP07J IDL07| . and the connection between the proofs of these theorems was demonstrated in, 
e.g., [RcnOl . However we are not aware of previous work which presents a single scheme that 
unites the different settings (SWF vs. SCF) and the different constraints (IIA vs. strategy- 
proofness) . 

In the past decade there has been a flurry of work done in providing analytical proofs 
of these theorems, and finding robust versions of them - i.e. showing that aggregators that 
almost satisfy the constraints (consistency, IIA, strategy-proofness) are close to fitting the 
classification (dictatorial). The relaxation of consistency in Arrow's theorem was initiated in 
[Kal01| and culminated in [Mosllj . which finally provided a robust version of the unmodified 
Arrow's theorem. See also |Mos091 iKellObl IKellOa] The same was done for several examples in 
the judgment aggregation setting in [NchlO]. See also [Xia08] . 

Relaxing the strategy-proofness constraint in GS has some important computational impli- 
cations. In [FK KN11] such a result was achieved for functions with m = 3, and in [IKM10] for 
neutral functions with m > 3. Recently [MRU] provided the final word on this theme, proving 
a robust version of the unmodified Gibbard-Satterthwaite theorem. 

Our work continues this line of research. A word or two on the novelty of our approach. The 
basic and beautiful idea in Kalai's paper |Kal01| . which was followed in most of the subsequent 



work, was using the IIA condition to translate a SWF to a set of Boolean functions, thus 
enabling the application of Fourier analysis on {0, l} n , a technique that is very useful for 
proving robustness results. The robustness referred to the distance between a given SWF to a 
function which is consistent, i.e. where the output is always transitive. In the current paper 
we chose to insist on the output being of the same form as the input (e.g. being a complete 
order in the SWF setting), and measure robustness with respect to the number of violations of 
a special set of constraints. 

To this end, and in order to allow a spectrum of results between SWF (Arrow) and SCF 
(Gibbard-Satterthwaite), we use a notion similar to IIA, called Independence of Rankings (IR) 
and show a robust impossibility theorem for it. IR first appeared in [DHlOb], where they 
characterize impossibility domains for non-binary judgment aggregation and IR is their prime 
example. 

We complete the analogue to Gibbard-Satterthwaite's theorem by showing that the impossi- 
bility theorem for the IR definition implies an impossibility theorem for some proper definition 
of strategy-proofness. This definition is an adaptation of a strategy-proofness definition intro- 
duced in |DL07j for binary judgment aggregation. 

Our approach leads naturally to representation theory of the symmetric group, which re- 
places the Abelian Fourier analysis that arises in the previous analytical works cited above. As 
in most applications of spectral techniques to combinatorial problems, this approach includes 
two components: The encoding of a combinatorial quantity as a quadratic form, and the ex- 
traction of combinatorial information from the spectral analysis of that form. In this work the 
algebraic encoding entailed the use of block matrices. The usage of block matrices encompasses 
substantial expressive power, as it enables the encoding of every Constraint Satisfaction Prob- 
lem for constraints that relate to pairs of input points, on any size of alphabet. The canonical 
uses of the spectral method, utilizing standard 0—1 matrices, are usually limited to a certain 
type of constraints on alphabet of size 2, which mainly enables the treatment of notions related 
to expansion in graphs. In this paper the spectral analysis involves tensor algebra, allowing us 
to take advantage of the block structure of the matrices. This enables the extraction of the 
combinatorial information encoded inside the block structure. 

2 Structure of the Paper 

The paper is organized as Follows: 

• In section [3] we define the constraints we are using and state the robust impossibility 
theorem. 

• In section |4] we provide a bird's-eye view of the proof. 

• In section [5] we recall some essentials of representation theory. 

• In section [6] we present the formal structure of the proof, divided into short lemmas, 
which are proved in the subsequent section. The section is divided into two subsections, 
subsection 16.11 which deals with the case of a single voter, and subsection 16.21 for an 
arbitrary number of voters. 



Section [7] provides the proofs of the lemmas from section [6] 

The final robustness result is derived from the conclusions made in section [6] combined 
with an extension of a result by Friedgut, Kalai and Naor ([FKN02J). Section [8] provides 
the proof of the extension of |FKN02| with the necessary adaptations to our setting. 
The main analytic tool this proof uses is a hypercontractive inequality of Beckner and of 
Bonami, which is also adapted to our setting in subsection 18.11 



In section [9l we introduce a strategy-proofness definition and show its connection to IR 
and an appropriate impossibility theorem. 



3 Results 

In this paper we present a robust impossibility theorem in the flavor of Arrow's and Gibbard- 
Satterthwaite's theorems. The constraint we will use is a variant of IIA. We present a single 
proof dealing with functions in a spectrum of ranges, from functions returning a full ranking 
(SWFs) to functions returning one alternative (SCFs), including a plethora of ranges in between. 
The result can be interpreted as a 2-query dictatorship test with full completeness. 

In our setting, we deal with aggregation of rankings of m alternatives. A ranking is a 
permutation x G S m . We will use the convention x(rank) = name. 

For presentation sake, we shall begin with the definition of the constraint when used for 
functions returning a full ranking (SWFs) and state the corresponding impossibility theorem 
without robustness. The more complicated definitions and theorems will follow. 

3.1 Main Theorem 

Definition 3.1: A social aggregator / : SJ^ —> S m satisfies Independence of Rankings (IR) iff 
the aggregated ranking of the j'th alternative is dependent only on the individual rankings of 
the j'th alternative 

Vx,y G S£ , j E [m], (Vi E [n},xr\j) = y 7\j)) =* f{x)-\j) = /(y) _1 (j) 

This constraint requires independence of rankings instead of independence of pairwise pref- 
erences required in IIA. This constraint was discussed in [DHl Obj. in the context of non-binary 
judgment aggregation. 

As in IIA, this definition compares voting profiles which may differ in any number of votes. 
Throughout this paper, we shall use an alternative, equivalent definition, that compares inputs 
that differ in a single vote (as is the case in the definition of strategy-proofness): 

Definition 3.2: A social aggregator / : S^ — y § m satisfies Independence of Rankings (IR) iff 
Vi G [n],j G [m],x -i G S*-\xi,ifc G S^x'^j) = y^ l (j) => f {x^ , Xi)" 1 {j) = f (x~* , yt)' 1 (j) 



It is easy to show that these two definitions are equivalent, via a hybrid argument. The 
corresponding impossibility theorem is 

Theorem 3.3: For m > 3, a social aggregator f : SJ^ — > S m that is IR is either a constant 
function or dictatorial of the following form: there exists a voter i and a constant permutation 
y of the rankings such that f{x) = y o X{. 

In |DH10bj . a similar impossibility theorem was shown, using purely combinatorial argu- 
ments. Their result deals with a larger range of possible formats of input and output, yet 
demands further constraints on the function in question. We do not see a way to extend their 
techniques to achieve robustness. 

3.2 Robust Impossibility Theorem 

A robust impossibility theorem means that when the constraint is almost satisfied, then the 
function is almost dictatorial. 

To the best of our knowledge, most previous robustness results regarding SWF's focused 
on relaxation of the rationality (the transitivity of the outcome), and measure the distance to 
a function which is rational. We, instead, demand rationality, and relax (our variant) of IIA. 
There is something satisfying about relaxing this specific constraint, rather than others, as it 
seems to be slightly less natural than rationality and unanimity. 

Definition 3.4: A social aggregator / : SJ^ —> §> m is called e — IR if the rate of constraints 
that are not satisfied is smaller than e, i.e. 

E P ^ 6 SSr\^ S S m li*7\j) = V7\j)) A (/(z-UrHi) ± f{x-\y t )-\j))] < e 

ie[n],je[m] 

Theorem 3.5: For m > 3, a social aggregator f : SJ^ — > S m that is e-IR is 0(m s e)) close to 
a function that is either a constant function or dictatorial of the following form: there exists a 
voter i and a constant permutation y of the rankings such that f(x) = y o Xj. 

3.3 A Spectrum of Ranges 

As stated earlier, we will also deal with a setting where the aggregated opinion is not a full 
ranking, but a partial ranking. Let H C § m be a subgroup of § m . We call it a fixing subgroup 
if it consists of all permutations respecting a given partition of the m rankings into 2 or more 
parts. An .ff-social aggregator is a function / : 8^ — > S m /H, where § m /F[ refers to right cosets 
of H (We use this notation even though H is not a normal subgroup). Many types of functions 
fall under this scheme. Examples are: 

• For H as the trivial group, ff-social aggregators are SWFs. 

• For H as the group of permutations fixing the element 1, .ff-social aggregators are SCFs. 



• For H as the group of permutations fixing the set {1,2,3}, //-social aggregators are 
functions returning triumvirates. 

• For H as the group of permutations fixing the sets {1} and {2,3}, //-social aggregators 
are functions returning a president and two vice-presidents. 

The definition of IR can be extended to //-social aggregators in the following manner: 

Definition 3.6: Let H C S m be a fixing subgroup of S m . For Hi a right coset of H in S m , 
and j £ [m] define the j-profile of H\ as the multiset H^ (j) = {y _1 (i)|y £ Hi}- 

Definition 3.7: Let H C § m be a fixing subgroup of S m . An //-social aggregator / satisfies 
Independence of Rankings (IR) iff the aggregated j-profile is dependent only on the individual 
rankings of the j'th alternative. 

Vt G [n],j e [m],x-' e S^\xi,yi G S^x'^j) = y- l (j) => /(s _i ,^) _1 (i) = fix'^Vi)' 1 ^ 

Example 3.8: Let H = § m) i, the group of permutations that fix the element 1. In that case, 
an H social aggregator / is a function that returns a single winner in an election. 

When the winner of the election is k, then the function returns the coset H k , which is the 
coset that includes all permutations that assign k to 1. The fc-profile of H is \H\ copies of 1. 

, I rxl 

The j-profile of H for every j ^ k has -j— j copies of each number between 2 and m. 

f is IR in that case if, for an alternative j, when given the individual rankings of j by all 
voters in a voting profile x, we are able to determine whether j is the winner of the election 
(i.e., the j-profile of f(x) is \H\ copies of 1) or not (i.e. the j-profile of f(x) has ^y copies of 
each number between 2 and m). 

U 

We shall leave the exact definition of an e-IR //-social aggregator to a later part of the paper, 
see definition 16.41 The impossibility theorems also extend to //-social aggregators. 

Theorem 3.9: Let H C S m be a fixing subgroup o/S m . For m > 3, an H -social aggregator f 
that is IR is either a constant function or dictatorial of the following form: there exists a voter 
i and a constant permutation y of the rankings such that f(x) = Hy o X{. 

Theorem 3.10: Let H C § m be a fixing subgroup ofS m . For m > 3, an H -social aggregator f 
that is e-IR is Oii(poly(m)e) close to a function that is either a constant function or dictatorial 
of the following form: there exists a voter i and a constant permutation y of the rankings such 
that f(x) = y o X{. 



4 Structure of the proof 

We give here a short exposition of the proof. For simplicity, we shall refer here to the basic form 
of the Main theorem (theorem 13. 3p . where the function is a SWF. To simplify the notation, we 
shall also use in this section definition \'d. II for IR, instead of 13.2^ which is the definition we shall 
use in the rest of the paper. 

We shall treat this problem as a constraint satisfaction problem (CSP). We shall use the 
definition 

Find all functions / : SJ^ — > § m s.t. 
IR: je [m],x,y £^, 

x-Hj) = y-\j) =* U(x))^{j) = (f(y)r\j) 

A CSP has a generic algebraic encoding. The function / can be encoded as a function returning 
a vector in IR m , which is the characteristic vector of the singleton {/(x)}. This encoding can 
be interpreted as a tensor F G IR^m* 8 ^ with 2 indices x,v G S r , 



y m 



■Tx,V i- V =f(x)- 

The constraints can be algebraically encoded using a matrix that represents their truth table, 
or, in our case, since we want to count the number of violated constraints, the truth table of 
their negation. We use a matrix of matrices. For every two inputs x, y G S^, the (x, y)'th entry 
of the matrix will be a matrix in ]R s ™ xS ™. This matrix will be the truth table of the negation 
of the constraints concerning x,y and j. This encoding can also be interpreted as a tensor: 







L ^ x y)v x v y ~ 1 [*- 1 (j)=j/- 1 (j)]A[«« 1 (j)?sr 1 a)] 



where j G [m],x,y G SJ^ and v x ,v y G S m . 

We can use these tensors in a quadratic form to count the number if unsatisfied constraints. 
Since L is the truth table of the negation of the constraints, the quadratic form 

^FL j F l 

j 

counts the number of violated constraints. 

The CSP under this encoding takes the form 

Find all FgR s ™ xS ™ s.t. 
Consistency: \/x G SJ^, Ff x * is a characteristic vector of a singleton 
IR: J2j FLJ F = G 

The proof unfolds as follows: 

• We show that L y (PSD), i.e. ^-FUF 1 > for every F. Therefore, the functions 
that satisfy IR are precisely the kernel of L. 



• Explicitly find the kernel of L, using diagonalization. 

• Show that all consistent functions in the kernel of L are dictatorships. 

As for the robustness, we will show that functions that are e-IR are L 2 close to the kernel of L. 
We shall generalize the result of [FKN02] to prove that such functions, that are also consistent, 
are L 2 close to dictatorships. 

For an H social aggregator, we shall encode / to return characteristic vectors of cosets of H, 
normalized so that their L\ norm equals 1. We shall call such vectors H coset vectors. As a 
tensor F, this encoding takes the form: 

F -J-1 

r X,V I rT | J-ogffi) 

\H \ J ' 

A very convenient feature of our definitions and approach is that the introduction of H social 
aggregators does not insert any new elements to the proof. Essentially, the same Laplacian L 
encodes the notions of IR and e-IR for H social aggregators, and only the consistency constraint 
changes. The algebraic CSP for H social aggregators is 
Find all F G M s - xSm s.t. 

Consistency: Vi G S^Fx* is an H-coset vector 

IR: J2i FLJpt = ° 



5 Representation Theory 

In this section we recall some basic notions of representation theory that are necessary for our 
proof. 

A representation is a Homomorphism p from a group G to GL^(C), the group of com- 
plex d-dimensional square matrices, d is called the dimension of the representation d(p). A 
representation is called irreducible if it is not similar to a direct sum of 2 representations. 

For a finite group, there is a one-to-one correspondence between the conjugacy classes of 
the group and irreducible representations (up to similarity). We shall denote the number of 
conjugacy classes of G as [G]. For a conjugacy class k G [[G]], its corresponding irreducible rep- 
resentation will be denoted as p . We will sometimes consider p as a function, and sometimes 
treat it as a vector with \G\ entries all of which are d{p k ) x d(p k ) matrices. 

In this paper we will deal exclusively with the symmetric group or direct products of the 
symmetric group. It is well known that in the symmetric group one can choose a basis for 
which all irreducible representations (also known as irreps) have real, unitary matrices as values. 
Henceforth we will assume we are dealing with such a basis. 

The defining representation of the symmetric group § m is the permutation representation 
P of dimension m. 

F\x)ij = i x ^ = j 



It is well known that P = p°®p l , i.e. it is the direct sum of two irreducible representations: the 
trivial one, which we denote by p°, and the (n — l)-dimensional p 1 . Specifically, if one chooses 
a basis for which p° , p 1 are real and unitary then there exists an orthonormal mx m matrix U 
such that 

P(x) = U(p°(x)(Bp 1 (x))U t (1) 

where here the © refers to a matrix composed of blocks. 

The all ones vector spans the one dimensional eigenspace of R m corresponding to the trivial 
representation component of P, hence U can be written in the following form (where C is a 
m x (m — 1) matrix, and the Cj's are its rows) 



U= : C = : : I (2) 




The character of a representation p, denoted by Xoi i s the trace of the representation: 
Xp{ x ) = tr(p(x)). It is easy to see the the character of similar representations p and UpU^ 1 
are the same. 



A final tool we wish to recall is Schur's orthogonality, which states that the vectors of the 
n pf- are orthogonal, (but not necessarily orthonormal) 

m! 



/ j P \ x )hjiP \ x )i2J2 ~ "kik2^hi2^. 



^d(p^) 

For a finite group G, these vectors form a complete orthogonal basis for the set of functions 
from G to C. 

Schur's orthogonality implies that the characters of irreducible representations are orthonor- 
mal: < Xp k iXp l > = $k,l- This implies that for a representation r and an irreducible representa- 
tion p, < Xti Xp > is t ne multiplicity of p in the decomposition of r to irreducible representations. 

A complete set of irreducible representations for G n is the set of tensors of the irreps of G: 

n 

{/ = (g)p r, }fG[[Gr 

5.1 Fourier transform and diagonalization 

For a finite group G, given a function / : G — > C, its Fourier transform at a representation p is 

f(p) = E xeG f(x)p(x). 
We shall sometimes use the abbreviated notation f(k) = f(p k ) 



For such a function /, define a matrix M G C GxG whose values are M X)V = f{x~ l y), then 
M can be partially diagonalized (decomposed into eigenspaces) using representation theory. In 
a partial diagonalization, an eigenspace does not have a corresponding eigenvalue, but rather a 
corresponding eigenblock. Given a chosen set of basis vectors {vi}i for an eigenspace of a matrix 
A, the (i, j)'th entry of the corresponding eigenblock is ViAiA. 

In the case of M, each irrep p defines an eigenspace spanned by the aforementioned or- 
thogonal vectors (pi,j(%)) X&G - The eigenblock corresponding to p k is I^ipk) <S> /(&)■ 

Another formulation of this is: M is a |G| x |G| matrix, whose entries are given by 



M x>y = £ d{p h )tr((^J(k)^) 

ke[[G]] V x ' y 



Where p k is a column vector of size \G\ whose x'th entry is the matrix p (x). The multiplication 
f{k)p k means multiplying the entries of p k by f(k), hence p k f(k)p k is a |G| dimensional 
matrix whose entries are d(p k ) dimensional matrices. 

For a function g, gM is known as the convolution of g and / and is denoted as g * f. The 
partial diagonalization discussed above shows that: 

gTf(k) = g(k)f(k) 

(gMg%, y = Y, d(/)fr ((?(fc)/(*)3*(fr) 

ke[[G]} V 



j-'-.y 



If / is a characteristic function of a set T of generators of G, then M is the adjacency matrix 
of the Cayley graph T(G,T). For convenience, we shall call such a graph a Cayley graph even 
when / is a characteristic function of any subset T of G, as the property of T generating G is 
irrelevant for our uses. 



6 The Proof 

In this section we present a more detailed version of the proof, divided into lemmas. The actual 
proofs of the lemmas will appear in section [71 

6.1 One Voter Functions 

We begin our analysis by treating the case of a single voter, since this contains the analytical 
seed from which the multi-voter case grows. The combinatorial problem for a single voter is not 
very difficult, although if one is interested in a robustness theorem it seems that straightforward 
elementary techniques are insufficient. 

In order to treat a social welfare functions on one voter / : S m — > S m , we construct in 
this section a quadratic form encoding the constraints, and, as mentioned in section SI we will 
diagonalize it. 

10 



For / : § m — > S m , denote by IR(f) the rate of unsatisfied constraints 

IR(f) = £ Pr X)y&m [{x-\j) = y- l (j)) A [f{x)-\j) + /(^(j))]- 

jS[m] 



Let X J be the matrix X £ 



pS m X§. 



m A >J>m 



VJ _ -1 -1 

Let X-? be its complement XJ xy = 1 — X xy . We will use the vector encoding described in(U 
F X)V = l v= fr x \ (or the corresponding definition for i?-social aggregators). 

The following lemma describes a quadratic form in the values of / that equals IR(f): 

Lemma 6.1: Let f be a social aggregator and F its encoding as described above. Let L' J = 
Xi XJ, and V = *£j L' j , then 

IR U) = W~l2 FL ' Ft 

X 3 is the adjacency matrix of the Cayley graph r(S m ,S m j), where S m j is the subgroup of 
S m of permutations fixing j. A quadratic form based on the Laplacian of that same graph is 
more suitable for our purposes, because it is PSD. The Laplacian of that graph, Y, is given by 

Y j = (m-l)U-X j 

The corresponding quadratic form is given in the following lemma. The quadratic forms given 
in lemmas 16.11 and 16.21 are equivalent when F represents a consistent function. 

Lemma 6.2: Let f be a social aggregator and F its encoding as described above. Let L" J = 
Y3 ®P, , and L" = £. L"* , then 

Since X the adjacency matrix of a Cayley graph, it can be partially diagonalized via the 
representations of the symmetric group, as explained in subsection l5.ll As will be shown in the 
proof of the following lemma, all the information relevant to the computation of IR(f) lies in 
X's p 1 component. This leads to a simplified quadratic form, used with a different encoding 
for /. Let g : S^ — > R( m ~ 1 ) x ( m_1 ) be a an encoding of / such that g(x) = p l (f(x)). A vector 
form of g is a vector G whose each entry is a m - 1 x m - 1 matrix G x = g(x). (For H social 
aggregators, g(x) = E yef ( x - ) p 1 {y)). 

The corresponding quadratic form is as follows. 

Lemma 6.3: Let f be a social aggregator and G its encoding as described above. Let L 3 = 
Y3 Di, where D3 = CjCj (See\jj), and L = £\ U , then 

IR W = W~T2 tr ( GLGt ) ( 3 ) 

11 



For an H social aggregator, IR(f) is denned as such: it averages, for pairs of inputs that 
agree on the ranking of j, the square of the £2 distance of the characteristic vector of the j-profile 
of the outputs. 

Definition 6.4: For an //-social aggregator / : SJ^ — > S m /H, define IR{f) to be 



1 -j 



/(as-Sasi)- 1 ^) ~ ^(ar-'.w)- 1 ^) 



\H\ 



2, 



Where for a multiset S, ng is its characteristic vector, i.e., for an element x, ng at the index x 
equals the number of occurrences of x in S. 

In the following we claim that the quadratic form we have defined in 16.31 indeed equals the 
value IR(f), as defined in definition 16.41 As usual, we only address the 1 voter case in this 
subsection: 

Claim 6.5: Let H C § m be a fixing subgroup of S m . Let / be an //-social aggregator over 1 
voter, and G be its encoding as defined above, then 

IR U) = Wl2 tr ( GLGt ) 

We partially diagonalize L in the following lemma, decomposing it to eigenspaces. For the 
purposes of this lemma, we shall define the operator tr that operates on block matrices. The 
operator returns a matrix whose (x,y)'th entry is the trace of (x,y)'th block in the original 
matrix: tr(M) xy = tr(M xy ). 



Lemma 6.6: 



L= Y, d(p r )tr((p r ®I)L(r)(p r ®I) 1 



re 



where 



2(0) = 1-0 , L(r > 1) =1® /• — 

m 

1(1) = — *— ( ^— -I®I-'yDi®Di 
ra-1 m *-? 



The diagonalization of L(l) is given by: 



Lemma 6.7: L(l) has 3 orthogonal eigenspaces whose dimensions are l,m — 1, (m — l) 2 — m. 
Denote their corresponding basis matrices as Uq,Ui,U2- The corresponding eigenvalues are 
0, m ( m _i) ; ^- The eigenvectors are vectors in H( m ~ l)-(m-i) _ When read as a (m — 1) X (m — 1) 
matrix, Uq is the identity matrix. 
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This diagonalization proves that L is PSD and hence, in light of Lemma 16.31 determines 
that its kernel is the space of IR functions. This is summarized in this corollary: 

Corollary 6.8: Let f : S m ->• S m /H be an IR function, and let g : S m -)■ K™- lxm4 be 
defined by g(x) = p 1 (f(x)) then there exist vectors a and b in R^" 1-1 )^" 1-1 ) ; that can be read as 
(m — 1) x (to — 1) matrices A and B) such that 

g x = bp°(x) + tr {{aU£)(I p\x)) =B + Ap\x) 

To complete the characterization of 1-voter IR functions, we present the following claim and 
lemma. We shall not prove the lemma as it is a consequence of the n voter case. 

Claim 6.9: Let if be a subgroup of S m , and let / be an if-social aggregator and g be g(x) = 

P X (f( x )), tnen 

\/x,g{x)g t {x) = M 

When H is a fixing subgroup, then M/0. 

Lemma 6.10: Let g : S m — > i^ m - lxm - 1 ft e a function of the form g(x) = B + Ap 1 (x) that 
satisfies the constraint Vx,g(x)g (x) = M for some constant matrix M/0. Then either B = 
or A = 0. 



6.2 Many Voter Functions 

The quadratic form capturing the notion of IR(f) for functions on n voters is constructed using 
the quadratic form for 1 voter, in the following lemma. 

Lemma 6.11: Let H be a fixing subgroup o/S m . For a function f : SJ^ — > § m /H, let G be as 
before, the encoding of f in terms of its p 1 component: 

G(x) = p 1 (f(x))=E yef{x) p 1 (y), 

where the expectation with respect to y refers to the case of H -social-aggregators. Let 

L n,j,i = j®i-l ^ yj q j®n-i q D j ^ L n,i = ^2 L n,j,i jn = ^ jn^ 

3 i 

Then the number of unsatisfied constraints is 

■ ' " - n /-it \ 



IR(f) = 7^+iHGL n G 
We can diagonalize L n based on our diagonalization of L: 
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Corollary 6.12: The diagonalization of L n is given by: 

£ n ' l = ^r4n E d(jf)tr((jf®I)i£i(r)(j> ,! ®iy 



1 ' re[[S m ]]» 
The L n ' 1 's are derived from the 1 voter L 's, as follows. 

t&[n],t=/=i 

The L n coefficients are matrices which are not necessarily diagonal. The following lemma 
partly characterizes their diagonalization, in a manner that suffices for our needs. 

Lemma 6.13: 

1. If there exists any coordinate i £ [n] for which n > 1 then L n (f) y —I (A y B means 
that A- B is PSD). 

2. Otherwise, if there exist at least 2 coordinates i £ [n] for which fj = 1 then L n (f) y 

3. Otherwise, if there exists exactly 1 coordinate i £ [n] for which r% = 1 then L n (f) = L{\). 
As shown in lemma \U77\ it has a eigenvalue corresponding to the eigenvector Uq and a 
smallest nonzero eigenvalue of O (i)- 

4- Otherwise, f is all zeros, and L n (f) = 0. 

For a PSD matrix, its spectral gap is its smallest non-zero eigenvalue. We conclude the 
following from the diagonalization of L n : 

Corollary 6.14: 

• The kernel of L n , which is the set of all IR functions, consists exclusively of functions of 
the form g(xi,...,x n ) = B + Ya=\ A 1 -p 1 ^) 



The spectral gap of T^—\L n is ^A 



rri 1 ) ' 

To finish the proof of theorem 13.31 we need to show that the intersection of the kernel 
of L n with the consistency constraint, includes only dictatorships. We don't need to use the 
consistency constraint to its full capacity. All we need to use is the quadratic constraint that 
\/x,g{x)g t {x) = M, where M is some constant matrix. In the proofs section, we shall show that 
this constraint is valid for any H. For instance, if / is a SWF (H is the trivial group), then 
since p 1 is unitary, Mx,g{x)g t {x) = I. 

Corollary 6.15: IR functions which are consistent are dictatorships. 
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6.3 Robustness 

By using the quadratic form, we are able to connect the combinatorial notion of IR(f) to the 
analytical notion of the distance between / and the kernel of L n . This connection depends on 
the spectral gap of L: 

Corollary 6.16: IfIR(f) < e, and g is the encoding of f as above, then there exists a function 
h in the kernel of L n such that \\h — g\\\ = E^eS™ \\h{x) — g{x)\\\ < 0(m 2 )e 

In corollary 16.151 we characterized IR functions, using the fact that the function satisfies 
two constraints: 

• Being IR, which a linear constraint, because it is equivalent to being in the kernel of L. 

• Being consistent, which a quadratic constraint. 

It is clear that the intersection of a linear and a quadratic constraint can contain only a few 
points. Indeed, we showed that consistent functions which satisfy the linear constraint are 
dictatorial. 

For e-IR functions, corollary 16.161 shows that the first (linear) constraint is relaxed to being 
Li close to the linear constraint. We wish to show that consistent functions that are Li close 
to the linear constraint are Li close to being dictatorial. 

In [FKN02] a similar result was shown for Boolean functions on Boolean variables. It was 
shown that Boolean functions that are linear are dictatorial, and that Boolean functions that 
are Li close to being linear are Li close to a dictatorial function. Being Boolean is, naturally, 
a quadratic constraint. We adapt this theorem to our setting. From it we deduce our main 
theorem. 



7 Proofs for section [6] 

7.1 Proofs of the Lemmas for 1 voter functions 

Proof of lemma I6.lt This is the straightforward definition of the anti-constraints. 

io i2 Z_> Fxv x L XVxyVy F yVy = 2_^ Fxv x X xy X3 VxVy F yVy = 

£»m ■ \^>m\ 

jxv x yv y jxv x yv y 

1 ^ 

jxv x yv y 



E^ 1 



x- 1 (j)=y- 1 (j) 1 f(x)-^iJW(y)- 1 U) 



□ 
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Proof of Lemma 16.2b 

Recall 

L" = Ej xj ® (J-Xi) = £\ X^®J - Xi®X^ 

L' = X)j ((m-l)!I-XJ) ® X»' = Ej (m-l)!J®^ - P®P 

Therefore, we need to show that 

F ( Z xj • J ) F * = F ( Z( m ~ w ® xj ) F * ( 4 ) 

when F is consistent. 

Since F is consistent, Vx, ^2 V F xv = 1, so the left hand side of (HJ is 

Y F XVx (Xl y • 1) F yVy =Y J Xly = rn-m\.(m-l)\= ml 2 

jxyv x v y jxy 

Since the diagonal of X 3 , for every j, is all ones, the right hand side of @ is 

Y F XVx ((m - l)\5 xy ® Xi xVy ) F yVy = Y, F ™ x {(m - 1)!^) F XWy = 

jxyv x v y jxv x v y 

Y ((m - l)!XJ (x)/ , } ) = £ ((m - 1)!1) = m • m! • (m - 1)! = m! 2 



jx jx 



□ 



Before we carry on, this is good place to recall U and C from equation^ The orthonormality 
of U implies the following: 



Claim 7.1: 






CC l = 


I - 


J 

m 


c l c = 


I 




\c = 








Proof of Lemma 16.3b We will need two simple claims and their corollary. 

Claim: 

Xi u = {U{p°(xy- 1 )®p 1 (xy- 1 ))U t ) jj 

Proof: Recall P, the defining representation of S m . A permutation x has a fixed point j iff 
(P x )jj is 1. Therefore, 

-^■xy \^x- 1 y)jj 

Recall that P x = U(p° © p l )U l , and that U is orthonormal. Therefore 
Xl y = (Pxy-i)n = (U {P^xy- 1 ) © pHxy- 1 )) U l ) .. 
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□ 

Claim: 

xi y =( lA Z 1)lt + c P 1 ( X y- 1 )c t ) (5) 

\ m J jj 

Proof: Follows from the expansion of U. 

n 

Next denote X* = X^° + X^ 1 , where X 3 x f = ( ±£&£^ ) and xh] = (Cp 1 (xy- 1 )C t ) ... 

\ / jj 33 

Likewise, denote V = U'° + Lj, 1 where V'° = Y^ (g) X^° and V' 1 = Y* ® a?* 1 . 

Corollary: 

IR(f) = FL'F 1 = ^ {FL^F* + FLP'^F*) = ^ FlJ' 1 F t 

j J 

Proof: The first equalities follow from the expansion of IR(f) and L' . We now show that 
FlJ^F 1 = 0. 

xyv x v y \ \ / j'i/ 

Since F is consistent, and p° = 1^, we have ^„ F XVx p°(v x ) = l x . Therefore 



,ry 



D 



It follows that 



j jxyv x v y m| 

In light of the above we can define G, a vector whose entries are (m — 1) dimensional matrices, 
as G x = ^2 V F xv p l (v), and the quadratic form becomes 

IR W= E T^( G ^kiA% C ^ C 3iy){G y ) Uv =Y,^r{G{Y^W)G t ) 

, , , \>->m\ ■ \>->in\ 



This completes the Proof of Lemma 16.31 

□ 
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Proof of claim 16. 5t Recall definition [67 



n f(x)-Hj)- n f(y)- 1 U) 



\H\ 



We will show the equivalency via two simple claims: 



Claim a: 



Claim b: 



IS 



^tr(GLG') = ^E x , y (l x - 1U)=y - 1{j) \\C^\x) - C^g\y)\\l) 



n f(x)-Hj) ~ "/(s/)- 1 ^) 



||cV(x)-cV(y)lll 



\H\ 



Proof of Claim a: Quadratic forms based on matrices which are Laplacians of graphs, such as 
y J , naturally differentiate values across edges of the graph. This can be seen via the following 
simple decomposition of Y J : Define the matrix Z J ' x,y to be the Laplacian of the graph whose 
vertices are the elements of S m and is either the empty graph or contains a single edge connecting 
x with y in case x _1 (j) = y~ l (j). 

Clearly, Y 3 = ]T\ ws m \ Z 3 > x > y . It is also easy to see that the diagonalization of the Z's 

is given by Z^' x ' y = l xy -i(j\ = jd x > yt d x,y where d x ' y is a vector that has 1 in x, — 1 in y and 
otherwise. 

Using this, we get 



tr (GUG 1 ) = Yl tr ( G ( ZJ ' X ' V ® DJ ) Gt ) 



£ l X y-i(j)=jtr (G (f'* &*) ■ (rf™ ® &) G 

J2 Vi(»>i fr ((9(x) - g{y)) C* ■ C* (g(x) - g(y)) 
(*,2/)e( s ™) 

£ l^iO)=, (C j (g(x) - giy)) 1 , C* (g(x) - 5 (y)) f > 
(^,y)e( s r) 



□ 



Proof of claim b: Clearly, the normalized characteristic vector of the j-profile of g(x) is 
TjjrTLf/ x )-i(j) = Cj (E,y£g( x )Py} , where ej is the j'th unit vector. When transforming this vector 
using the orthonormal matrix U, we get: 

-Ln fix) _ 1{j) U = (ejEy^pt) U = (e d U (l © g\x)) U l ) U = e 3 U (l © g\x)) = 
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Ill 



Cj ){l®g t {x)) 



in 



C j g\x) 



Therefore, since U is orthonormal, 



n 



f{x)-m)- ll f{y)-Hj) 



\H\ 



n 



fi^-Hj) ~ ll f(y)-Hj) 



\H\ 



U 



m 



c j 9 f (x: 



m 



C J g\y) 



HcV(x)-cV 



□ 



Proof of lemma 16.6b Recall that 

L = ^2(( m - 1)11 - X j ) D j 



The identity matrix can be decomposed using the irreps of § m and Schur orthogonality. 
<»' - A)!/, (/ = ^=^ ( £ d(p r )p r (x)p r (y~ 1 ) 



ml 



,re\\s„ 



Recall the decomposition of X 3 from ([5]): 



xL = p°^p°(y- 1 ) +CjAx)pHy -i n 



■zy 



in 



Summing these decompositions, and using the fact that YLjD 3 = ^jC^Cj = C l C = I 
yields that for a fixed pair x,y, one has 



I 



■>-u 



p°(x) <S)I 

(m — l)tr( p 1 (x) (8 I 

Ef>7 ]l d(p k )tr( p k (x)®I 

p°(x) <g)I 

(m — l)tr( p 1 {x) (8 I 

ELm" d(p k )tr( p k {x)®I 



as required. 



1 / m—l 



m—l \ m 



m m 



'-i^iE.cp, 



1 / m— 1 



m—l 




I ® J - £j Di ® IP 



^(y- 1 : 


(8)1 


+ 


^(y- 1 ; 


(8)1 


)+ 


p^y- 1 


(8>I 


) 


^(y- 1 ; 


<8>I 


+ 


^(y- 1 ; 


CS>I 


)+ 


p fc (y _1 


(8/ 


) 



□ 
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Proof of lemma 16. 71 Denote by E, a in x (m — l) 2 matrix whose j's row is Cj (8 Cj. We 
need to diagonalize 

Y^ D j ® I>» = J] (CjCj) ® (CjC,-) = ^ (C* ® Cj) (Cj ® Cj) = E l E 



The nonzero eigenvalues of E t E are the nonzero eigenvalues of EE (this can be deduced from 
the SVD decomposition of E). Recall CC l = I — —. Therefore, 



(EE 1 ) = (d ® d) (Cj <8 Cj) = (CC*) 2 = (CC*) 



<5ii J = (l )Sij+ ( — 



Therefore, EE 1 = m -^ L I + -^?, and its eigenvalues are ^-^ with multiplicity 1 and m -^ 

lit Til J ft. lit 

multiplicity m — 1. 



We can verify that the eigenvector of E t E corresponding to the ^— eigenvalue is Uq, which 
is the identity matrix parsed as a vector. We need to use two simple facts: 

• For 3 matrices A, B and C, the term A- B ■ C l when parsed as a vector, is equal to the 
term (A (8 C) ■ B, when B is parsed as a vector. 

• For a matrix C whose rows are {Cj}j , the term V • Cj (8 Cj equals to the term C t C parsed 
as a row vector. This is because C t C = ^ ■ C-Cj. 

Therefore, we get that 

J \ m — 1 



(E ■ U )j = (Cj Cj)U = CjIC) = CjC] = (C7C% = (l - ~) 



# m 



Which means that E'C/n = l 122 -^- and 

u m 

(i^) Db = EH ■ ^-1 = ^-1 I Vc 3 ® Q 

Since ^ ■ Cj (8) Cj is C t C parsed as a vector and C l C = I and Uq is I parsed as a vector, we 
get that (&E) U = ^U . 

n 

Proof of corollary 16.8b We have shown that eigenspaces of L with eigenvalue are p° and 
p 1 ^/ . All the other eigenvalues are positive, so L is PSD, and IR functions are in the kernel of 
L. 

Therefore, if G is an IR function: 
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1. Its p° Fourier coefficient can be anything. 

2. Its p 1 Fourier coefficient must be of the form A (U ^ for some vector A. Recall that this 
Fourier coefficient has 4 indices, and that U° is the identity matrix, parsed as a vector, 
and also A is some matrix parsed as a vector. 

3. All of its other Fourier coefficients must be 

Explicitly, 

g(x)ki = B k[ p°(x) + s ^p] s {x)5 s iA kt = B ki + ) j A kt Pu(x) 
ts t 

This means that the function g is g{x) = B + A ■ p 1 (x) (where A and B are parsed as (m — 
1) x (m — 1) matrices), so g is a linear function in p 1 (x), and, according to lemma 16.101 since 
g is consistent, either A or B are 0. 

□ 

Proof of claim l6T9t Let Mh be Mg = K X £np 1 (x). Since g is consistent, it must be of the 
form \/x,3y,g(x) = Mnp 1 (y)- Therefore, 

g(x)g(x) t = M H p 1 (y)p 1 (y) t M t H = M H M H = E xeH p 1 (x)E yeH p 1 (y- 1 ) = 

E x ^ HP 1 {xy- 1 ) = E zeH p l {z) = M H 



We now turn to show that Mh ^ when H is fixing. Denote Pjj = E xg #.P(x). Clearly 
Ph = U(l © Mh)U 1 . Therefore, if Mh = 0, then Ph — J ■ However, let i and j be 2 indices 
from 2 different parts of the partition that H fixes, then clearly {Ph)u 7^ (Pit)ij, because for 
every x G H, Pij(x) = 0, and for some x € H , Pu(x) = 1. 

□ 

Remark 7.2: Notice that when H is not a fixing subgroup, it might be the case that Mh = 0, 
and then g = 0, and every consistent H social aggregator satisfies IR. This is the case, for 
instance, when H is the group of even permutations. 

□ 



7.2 Proofs of Lemmas for many voter functions 

Proof of lemma l6.11t We show that applying the n voter quadratic form is similar to applying 
the 1 voter quadratic form but over pairs of inputs that differ in only one voter: 

GL n* G i = ^ G(x) (I®*- 1 © Yi © I® n - 1 © IP) G\y) = 
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(i-1 n \ 

Y, G(x-\xi, yt) (y J 8) D>) G\x-\ Vi ) 
From here, the proof follows exactly in the path of the proof of claim 16.51 



□ 



Proof of lemma I6.13E 

1. Let i be such that t% > 1, then, by corollary 16.121 

^(f) = -^(g)/ d ( rk )®im-l 
fe=l 

Since L n (f) = ^ • L n >i(f) and for every j,f, L n 'J(f) y 0, we have 

2. We shall focus only on the case where there are exactly 2 distinct i and j such that 
7*j = Tj = 1, as this case produces the minimal eigenvalue. Indeed, assume that there are 
k > 2 such indices i\,i2, —ik f° r which n. = 1, and denote r' to be 1 in %\ and 12 and 

otherwise, then L n > H (r) + L n ' l2 (r) = L n (r') ® I m ~\- Since the rest of the terms L n,Jfe (r) 
are PSD, The minimal eigenvalue of L n is at least as large as the minimal eigenvalue of 
L n (r'). 

Recall the diagonalization of L(l) (from lemma loT7|) 



1 ' - - - ' —OrrO* 



L(l) t — r- /»/ rC/ U C/ 

m(m — 1) \ m — 1 

From this, it is easy to deduce the following: 

Ln(r) = L^(f) + L^ttf) y — -i [21® I® I — (^M** + A^')] (6) 

m(m — 1) \ m-1 \ // 

Where A 1 is a (m — l) 3 x (m — 1) matrix of the form -A( pgs )t = S pq 5 s t (p, q, s and t are 
indices going from 1 to m — 1. (pgs) forms the row index of A 1 and t is its column index). 
Likewise, A> is a (m — l) 3 x (m — 1) matrix of the form Ai pqs \ t = S ps 5 q t. Denote the right 
hand side of ([6]) as Q. 

Denote B = A i + A? and C = A* - A*. It is easy to see that i ! i' ( + A' A? 1 = 
2 [BB t + CC 4 ) . The following are easy to verify: 

• B l C = C l B = 
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• BB l = 2m-I 

• CC f = (2m - 4) • / 

Therefore, we may deduce that the columns of B and the columns of C are orthogonal 
eigenvectors of A 1 A 1 + A 3 A 3 with eigenvalues m and m — 2, respectively. Plugging this 
into the expression for Q, we get that the minimal eigenvalue of Q, corresponding to the 
columns of B, is , x lS (2 - -=M = ," 1 ~ 2 1 , 2 . 

' m(m— 1) v m— 1 / m(m— l) z 

Important Note: Notice that if m = 2, Q has eigenvalues equal to 0, and therefore we 
cannot deduce that the function is a dictatorship for m = 2. 

Items 3 and 4 are trivial. 

Proof of corollary I6.15t We need to show that only one of B, A\, ..., A n is not zero. We 
begin by showing that w.l.o.g., we may assume that E x g(x) = and therefore B = 0. Indeed, 
we introduce a dummy variable y £ S m and define 

g'(x,y) = g{xy' 1 )p 1 {y) 

Where for x S S^ we denote xy = (x\y,X2y, ...,x n y). Note that E, x>y g'(x,y) = because 

^x, y g'(x,y) = E x>y g'(xy,y) = E x>y g(x)p 1 (y) = E x g(x)E y p 1 (y) = 

and that IR(g') = IR(g). Assume the claim is true for g such that Eg = 0, apply it to g' to 
get that either g'(x,y) = A i p 1 {xi) for some i, or g'(x,y) = Bp 1 (y). In the first case, it follows 
that g(x) = Aip 1 (xi), and in the second case, it follows that g{x) = B. 

We now assume that B = 0. Recall that by claim 16.91 we have g{x)g{x) t = M for some 
M/0. On the other hand, 

g^gixf =Y t A i P \x i )p 1 \xj)A^ 

The summand for i,j, translates to 

p,q,w,s 

From this expansion we may deduce the Fourier coefficient of gg l at f where f is 1 at i and 
j and otherwise: 



99 t tpu,u qs if) = (m - l) 2 ( (AipAlJgs) + (AlpAlJqs) j (7) 

Since gg l is a constant function and the Fourier expansion is unique, we get that gg t (r) 
must be 0. Assume by contradiction that there exist indices tp and wu where A\ p ^ 0, A 3 WU ^ 0. 

Equation (J7J) implies that A 3 tp A l wu = —Al p A J wu / 0. Therefore, A J tp / and gg t t P tpqqf) — 

At p A tp 7^ 
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□ 

Proof of corollary I6.16t 

The result follows from the following general statement regarding quadratic forms. 
Claim: Let K be a finite set. Equip the linear space M with the L2 metric d(u, v) = 
^xek(v x — u x ) 2 . Let M G M. KxK be a PSD matrix with spectral gap A, then for any vector u, 

T—ruMu* < Xd(u,ker(M)) 

\K\ 

where ker(M) is the kernel of M and d(u, ker(M)) is the minimal distance of u from an element 
of ker{M) according to d. 

Proof: Assume that M has eigenvalues = Ai < A2 < ••• < \kU with corresponding eigen- 
vectors vi, ...v\ku an d that s is the first index for which X s > 0. Denote by Ui the projection 
of u on V{. Clearly, the distance of u from the kernel of M is t4m Y^ s uf, and the value of the 
quadratic form on u is 



u 



\K\ \K\ \K\ 

l Mu = Y^ ujXi = Y u}\i < A s ^ f 2 



D 



In our case, we have shown in lemma 16.111 that IR(f) = i s , n+1 tr(GLG t ). When the 
elements of G are parsed as vectors instead of (m — 1) x (m — 1) matrices, this translates to 

IR(f) = w ^ TT G-(I®L)G t 

Since the spectral gap of w— rL is O (^1) , so is the spectral gap of Tg—\I ® L. 

Therefore, HIR(f) < e, then there exists h in the kernel of I®L such that \\G— h\\2 < 0(m 2 )e 
(when the elements of G and h are parsed as vectors) . 

□ 



8 Adapted version of [FKN02] 



For the sake of self-containedness, we present here the proof from FKN with minor modifications, 
needed for the application of the theorem to our setting. 

The adapted theorem goes as follows 

Theorem 8.1: Let Lin(S^) be the space of functions of the form ^2iA t p 1 (xi). Let g : SJ^ — > 
gm-lxm-l £ e a j unc n on SUC } 1 t^t 
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• Eg = 

• There exists a matrix M such that tr(M) = 1 and Vx S §J^, g{x)g t (x) = M . 
. \\g-Lin{^)\\l<r. 

then there exists i such that E||g — A 1 /? 1 (x^) H^ < 0(m 5 T). 

Before we carry on with the proof of theorem 18. 1\ we shall show how to apply it to get the 
proof of the main theorem: 

Proof of theorem 13. lOt Let / be an e-IR H social aggregator and let g be g(x) = p 1 (f(x)), 
as before. We assume, w.l.o.g., that Kg = (see the proof of I6.15p . Since g is consistent, by 
claim [6791 it holds that there exists some matrix M such that for every x, g(x)g t (x) = M. Let 
K = tr(M). By corollary 16.141 we have K\\g — Lin(§m)\\2 — 0(m 2 )e. Therefore, we may apply 
theorem 18.11 on -j^g and get that there exists i such that E||g(x) — ^p 1 ^)!!! < OiKmJe). 

This is almost what we need, except that we have no guarantee that A l p 1 (xi) is consistent. 
However, we show in lemma [8721 that we can round it to a consistent function without losing 
more than a multiplicative constant factor in the distance. 

The value of K ranges between 1 (for SCF's) and m (for SWF's). 

□ 

Next we show, as promised, that we can "round off' our approximation to an approximation 
which is consistent without losing too much. 

Lemma 8.2: Let g : §J^ — > R( m_1 ) x ( m_1 ) be a function such that there exists a matrix M such 
that range(g) C M ■ range(p 1 ) . Assume that there exists a function h : S^ — > ]R( m_1 ) x ( m_1 ) ; 
where h(x) = Ap 1 (xi) for some i, and \\g — h\\2 < 6, then there exists a function h! = A' p 1 (xi) 
such that range(h') C M ■ range(p 1 ) , and \\g — h'\\2 < 25 

Proof: Assume that there does not exist A' € M • range{p l ) such that \A — A'\2 < 5, then 

\g - h\ 2 = \gp\x- 1 ) - hp 1 ^- 1 )^ = \gp\xr 1 ) - A\ 2 > d{M ■ rangeip 1 )^) > 5 

which is a contradiction. Therefore, there exists such A' . Define h! as above h'(x) = A'p 1 (xi), 
then 

\g-ti\ 2 < \g-h\ 2 + \h-ti\ 2 <25 



□ 



Proof of theorem I8.lt 

Let h be the projection of g on Lm(S^) and let 

q = g-h 
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and 

r = f-f-M 

Since Eg = 0, there exist A l, s such that h = ^2 i A l p 1 (xi). We will show that since h ■ h l is close 
to M, r is typically close to 0, and we will deduce some information on the A l, s. 

In the proof of corollary 16.151 we used the fact that when e = 0, r = 0. For the case when 
e is positive, we will try and show that r is close to the function. 

Remark 8.3: Note that the entries of r are of the form 



r k ,i{x) = \C + Y J A i p 1 {x i )p lt {x :j )A t :j 



i- j 



k,i 



Which makes functions of degree 2. For a formal definition of the degree of a function, see 
definition 18.111 to come in subsection 18.11 

□ 

Remark 8.4: In the following lemma, we use a constant C = — 5 that is defined in corollary 
18.131 We shall call it C from now on so that it will be easier to trace its origin. 

□ 

Lemma 8.5: 

E(r 2 ) <Ke = 108(m - l) 4 C 8 e. 

Corollary 8.6: There exists i such that [|A*||| > 1 - (l + ^ m [_}} K ) e = 1 - 0(m 3 )e. 

Clearly, this corollary implies theorem 18. 1[ 
Proof of corollary: 

Because of the orthogonality of h and q, and because E||<?||2 = 1, E||/i||2 = 1 — e. 
Also, because of the orthogonality of the p 1 (xj)'s, 



E||r||i > X>PV(^(^V (*;))'lli 
Expanding this expression we get: 

spy (x<)(AV to))*!!! = 
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E ziZ2 2-^ ^a/3iP/3i7 1 ( x 0/ 5 77i7i( x i)^r ? i^L/3 2 / 9 /3272( Xi )^»7272V X i)^W2 

a/48171 7/1/8272 ?72 

/ j ^-al3i"Pi^2 "71 72 "»71 m "71 72 ^£tr/i ^afe A" 



ap,Pi'y 1 rnP 2 l2V2 



m _ i Z^ A aP A U Al al3 A U 
afj,l3r) 

1 U'llSllA''! 12 



-, n 2P 2 

m — 1 

Therefore, if max ||A||| = i 

(E||/ i i) 2 = (i- e ) 2 = E(i^ii2) 2 + £ M'iiiii^ill< 

(1 - e)t + (m - l)E||r||l < (1 - e)i + (m - 1)^6 
which gives the desired bound on i. 

□ 

Proof of Lemma 18.51 : The proof consists of two parts: first we will show that typically 
r obtains values close to 0. Then we will use a hypercontractive estimate due to Beckner and 
Bonami to bound higher moments of r in terms of its second moment showing that its tail 
decays fast enough. 

Lemma 8.7: Let < a < 1/4 be a constant to be chosen later. Let 

p = Prob(\r\ 2 > a ). 

Then 

16(m-l) 2 e 

P< 2 ■ 

or 

We defer the proof of this lemma for the moment. 
Lemma 8.8: 



E(\\r\\l)< 



(m — l) 2 a 2 



l-4(m-l)C 4 Ve/a' 



Choosing the optimal value of a (which is 6(m — 1)C ye) immediately proves Lemma 18.51 
So, to finish the proof we now present the proofs of Lemmas 18.71 and 18.81 

Proof of Lemma 18. 7\ Recall that q = g — h and that E||g||| = e. Using the fact that gg l = M 
yields 

r = hh l - M = qq l - gq l - qg l 
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By the triangle inequality, 

||r(x)|| 2 < ||r(a:)||i < \\q(x)q t (x)^ + 2\\g(x)q t {x)\\ 1 

We shall prove in claim [8U1 that, for d x d matrices A and B, ||AB||i < c?|| ^4. 1 1 2 1| -^^ II 2 - This 
implies 

||r(x)|| 2 < (m- l)(||g(x)||| +%(x)|| 2 ||g(x)|| 2 ) 

denote t = ||g(x)|| 2 . We use that fact that ||g(x)|| 2 = tr (g(x)g t (x)) = tr{M) = 1 

||r(x)|| 2 < (m-l)(2t + t 2 ) 

A simple analysis shows that if t < a/4(m— 1), then ||r(x)|| 2 < a. Hence by Markov's inequality 



Pr[||r(jc)|| 2 > a] < Pr[\\q(x)\\ z 2 > (a/4(m - I)) 2 ] < 



(4(m-l)) 2 e 
I2. 






Claim 8.9: The following is the version of Cauchy- Schwartz that we used in the previous proof. 
Let A and B be two d dimensional real matrices, then 

||AB||i <d||A|| 2 ||B|| 2 

Proof: Let J be the all ones matrix. It is easy to see that for a matrix A, \\A\\i =< A, J >. J 
can be decomposed to a sum of d permutation matrices P 1 , ...P d . Then, 

d d d 

\\AB\\ X =< AB,J>= ^ < AB,P { >=Yj< A i Btpi >< Yj Plbll^ih = dp|| 2 ||£|| 2 

i=l i=l j=l 

When A = B = J, this inequality is tight. 

□ 

Proof of Lemma 18.8b For convenience of notation let X = i?||r|| 2 , Xij = E(rfA and Yij = 
E(rfj). Also denote pij = P(|fy| < a). In corollary 18.131 we show a Beckner type inequality of 
the form: 

Yij < C X {j 

(Notice that, as mentioned in remark 18.31 the entries of r are real functions of degree 2 with 
Fourier coefficients only in p° and p , and therefore conform to the requirements of corollary 
I8.13P Using this we obtain 



X = J^Xij = X>H) = E {^-Pv)E{r 2 l3 \r 2 l3 < a 2 ) + PlJ E{r 2 3 \r 2 3 > a 2 )) < 
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£ ((1 - Pij )a 2 + Pij yjE{rt j \rt J >a*)) < 



This yields 



SI* WSJ* 

£ (a 2 + v^a 4 ^) < 

^ (a 2 + VPC 4 ^) = 

(m - 1) V + ^/pC A X < 

(m - 1) V + 4(m - 1)^C 4 X 

a 



x< (m-l) 2 a 2 



l-4(m-l)C 4 Ve/a' 



a 



8.1 Beckner's inequality 

This subsection is an adaptation of results from [Wol07j for our case. 

8.1.1 Introduction to hypercontractivity 

Let be an arbitrary finite set endowed with the measure /i. Assume that [i has at least 2 
atoms with non-zero measure. Let & be a linear subspace of the space of all functions from Q 
to R, that includes the constant functions. 

Define the operator L to be the orthogonal projection to functions with zero mean L = 
Id — E^. Define the semigroup T t = e~ tL (t > 0). We shall use the explicit formula 

T t = E ll + e-'L = (1 - e- l )E^ + e^Id 

Definition 8.10: The semigroup (Tt) t> is (p, c/)-hypercontractive (for 1 < q < p < oo) over S 
if there exists to such that for all t > to and / G L q {p) n & 

\\ T tf\\L P (fi) < II/IIl,( M ) 

If such a to exists and is the least possible, then a Ptq (fj,, 6) = e _i ° is called the (p, q)-hypercontractive 
constant for the measure \x over S. 
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Multivariate functions are functions from £l n to R. The operators L and Tt have multivariate 
parallels. Define 



V = Ld- E Xi M , Tl = e~ tL \ T t = \{Ti 



From supermultiplicativity of norms, it is easy to observe that for a linear space of func- 
tions (3 as above, a p ^ q {n,&) < o~ Py q(p, 6 n ). However, when q < p one can apply Minkowski's 
inequality to get the inequality in the opposite direction, and deduce a p „(p,&) = a Ptq (p,& n ). 
In our case, & is the space of functions from § m to R whose Fourier transform is supported on 
p° and p 1 , 

We now wish to consider the degree of multivariate functions. A monomial of degree d is a 
function / such that there exists a set of indices S C [n] of size d such that for i £ S, L l f = f 
and for i <£ S, Uf = 0. 

Definition 8.11: A function of degree d is a function in the linear span of all monomials of 
degree < d and not in the span of monomials of degree < d — 1. 

We will use the following corollary of the hypercontractivity for functions of degree d: 

Claim 8.12: If <J p , 2 (p, (S)) = e"* , then for / G 6 of degree d, 

Proof: Write / = T t T~ x f '. Therefore, 

Wfhpifi) < \\Tf f\\L 2 {fi) 

Now write f = fo + fi + ■■■ + fd, where fi is the projection of / onto the monomials of degree 
d. Then Tf f = X^=i e *° */«• Because the monomials are orthogonal to each other, we have 



i^rViiI = nE eto '^ni = Eii eio '^ii2 < e24o ' d E 

i=l i=l i=l 



A||2 _ e 2-t -d 



□ 

In the next subsubsection, we will calculate a for our needs and arrive at the following lemma: 

Corollary 8.13: Let m > e 4 . Let r be a function r : §^ — >• R, of degree 2, all of whose Fourier 
coefficients are supported on tensors of p° and p 1 . Then we have 

llrll 4 < r^llr-ll 4 

IrlU ^ ^ \\ r \\2 



forC= ^ 2 

J m. 
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8.1.2 Calculating a 

In this section, we shall calculate a = a Ptq (p, <5 m ), where & m is the space of functions from § m 
to R, whose Fourier transform is supported on p° and p 1 (and p is the uniform measure). Our 
main result is the following. 

Theorem 8.14: There exists some uiq such that for all m > m,Q, 

0"4,2(M, ©m) > rrT2 



In the following claim, we present the functions in & m in a form that will help us in the 
analysis: 

Claim 8.15: For / G 6 m , /(x) = tr(AP(x)), where A is the matrix U (/(0) © (m - 1)/*(1)) £7* 
(See the definition of U in (JT])). 



Proof: Since / 6 (3, we have /(x) = /(0)p°(x) + (m — l)tr ( /*(l)/9 1 (x) I . This equals to 
fix) = 7(0)p°(x) + (m - l)tr (/*(V(z)) = £r ((/(0) © (m - 1)/*(1)) • (p°(x) p x (x))) 
ir (V (J(0) © (m - !)/*(!)) t/ _1 ■ 17 (p°(x) © p\x)) CT 1 ) = tr{A ■ P(x)) 



Note that by the definition of U, we have 1A = (Al*)* = 1/(0). 
We define the following moment-like operators on A: 



a 



M 1 (A) = ^A i3 M 2 {A) = Y J A% M 3 (A) = J2Al j M 4 (A) = ^Ai 



I 1 . 

ij 

M i,3 i,3 i,3 



MM) = £ ^4-j M C (A) = £ ^4) 2 

ij \ k / 

We can express the 2'nd and 4'th norms of functions in & m using these operators, as shown 
in the following lemmas: 

Lemma 8.16: Let f be a function in & m and let f(x) = tr (AP(x)) as in claim [KT5\ then 
|/|i = E x f(x) 2 = -±- ({m - 2)M 2 (A) + AI{[A) 



m — 1 \ ' ^- 2 
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Lemma 8.17: Let f be a function in & m and let f(x) = tr (AP(x)) as in claim [KT5[ For 
some t > 0, denote a = e . Then 

\T t f\i = E x f(x) 4 = (m _ 1)(m i 2)(m _ 3) ( (^ + O (£)) Mf(A) 

- 0(a 4 )M 1 (A)M 3 (A) 

o(^)Ml(A)M 2 {A) 

+ 0(a 4 m)M 2 (A) 

+ o- 4 (m 2 + m) M 4 (A) 

- O (a 4 m) M r (A) 

- O (a 4 m) M C (A) 

+ 0{£)M q {A)) 

The proofs of these lemmas will be given later. 

Next, we shall assume that / is normalized so that that l/l 2 , = 1 and bound the terms 
appearing in the expression for \Tf\\ under this constraint: 

Claim 8.18: Let / be a function in <5 m such that ||/|| 2 = 1, then: 

1. \M X (A)\ < 0(m) 

2. M 2 (A) <m-l 

3. |M 3 A| < O (ml 



13- 

4. M A {A) < 0(m 2 ) 

5. -M r (A) < 0, -M C (A) < 

6. M q (A) < M$(A) < 0(m 2 ) 

Also, the leading coefficient in all of these bounds is 1. 

Combining all of this information yields Theorem 18.141 

Proof: Assigning the bounds from claim 18.181 in the expression from lemma 18.171 we see that 
all the term except for three are o(^). Addressing the remaining terms we have 

_ f||4 s ^Mt(A) + 0{a A m)Ml(A) + a A m 2 M i (A) 1 

\\ 1 tJ 4 — 7 7T7 7W7 ^ rO{ — ) — 

(m — l)[m — 2){m — 6) m 

m 2 (m — 12) + O (a^m 3 ) + a 4 m 4 1 

(m — l)(m — 2)(m — 3) m 

When choosing a = m~z we get 

4 = m 2 (m - 12) + Q (m) + m 2 1_ 

11 tTU (m-l)(m-2)(m-3) { m > 

This expression is asymptotically smaller than 1. 
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□ 

We shall now present the proofs of the lemmas. We begin with a complicated proof of the 
following easy lemma. 

Lemma 8.19: Let f be a function in <5 m and let f(x) = tr (AP(x)) as in claim [KT5\ then 

M 1 (A) 

Ea;/(x) = 

m 

Proof: 

E x f(x) = E x tr (AP(x)) = tr (AE x P(x)) 

Remember that P is a representation and is reducible to a copy of p° and p , so 

E x P{x) = U (E xP °(x) © E xP \x)) U l 

Because of Schur's orthonormality, we have 

E x p°(x)=<p°,l>=l 

E x p 1 (x)=<p\l>=0 
Therefore, E x P{x) = U (1 © 0) iP" = ^. We can now conclude that 

Klf{x) = tr ( A l) = Mi^> 

\ m J m 

U 

We shall use a similar technique to prove lemma [8.161 regarding the 2nd norm of /. 
Proof of lemma !8.16t We have 

f(x) = tr (AP(x)) ■ tr (AP(x)) = tr ((A ® A) (P(x) <g> P(x))) 

and therefore, 

E x f{x) = tr ((A © A) E x (P(x) ® P(x))) 

Denote Q = E x [P(x) <g) P(x)]. We need a closed formula for Q, which we shall obtain by 
diagonalizing it. Notice that P®P is a reducible representation. When taking the expectation 
E x [P(x) fg) P(x)], all the copies of p° will become 1, and the copies of other representations will 
zero out. The rank of Q is the multiplicity of p° in P P, which is: 

< X P °,XP®P >= E x xp®p(x) = E x I ^^PuixjPjjix) 

\ i 3 

Ve,.^( x ) + V EvPutoPtfix) = m ■ - + m(m- 1) • , * -r = 2 
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So Q's only nonzero eigenvalue is 1 with multiplicity 2. Before computing the eigenvectors of 
Q, notice that Q is symmetric, because Q l = E x P t (x)0P t (x) = E x P(x~ 1 )<3P(x~ 1 ) = Q. The 
eigenvectors of Q are of size m 2 . We can guess two eigenvectors of Q. We denote them by u 
and v, and list them by their entries, indexed by i,j £ [m] (Recall that 1 is the all ones vector, 
and lj is its ith entry, which equals 1): 

Zlij — ±i Lj 

It is easy to see that for every x, u and v are eigenvectors of P{x) <g> P{x). Therefore, these 
vectors are also eigenvectors of Q. However, these vectors are not orthogonal. We need to find 
linear combinations of them which are orthonormal. In other words, let E be the 2 x (m 2 ) 
matrix whose rows are u and v. We wish to find a 2 x 2 matrix O of coefficients such that OE 
will be an orthonormal matrix, i.e. OE ■ (OE) 1 = OEE t O t = I. In that case, we will have 
Q = (OEfOE = EtOtOE. Denote C = EE\ then, 

OCO 1 = I =>• Cr 1 = CO 1 => CO l O = I => O l O = C' 1 



i 



Since C = EE l is symmetric and PSD, we may choose O to be O = C 2 and then O is also 
symmetric. It is easy to verify that 



and then 



uu l 


uv l \ 


/ m 2 


rn 


vu f 


vv l J 


\ m 


in 


-1 


1 


( 1 





C = EE l 



o t o = oo = c- . 

(m — \)m \ — 1 m 



Now 

tr ((A ®A)Q) = tr ((A A) E l O l OE) = tr (E (A A) E l OO) = tr (E (A <g> A) E 1 ^ 1 ) 
One can verify that 



t _ ( u{A®A)u t u{A®A)v t \_f M%(A) ±M?(A) 
v(A® A)u l v(A®A)v t J ~ \ ±M 2 (A) m M 2 (A) 



Finally, 

E x f 2 (x) = ^-—tr I I ■'"-'- ■' ' 



1 tr (( M 2 {A) ±M 2 {A)\ ( 1 -1 

(m-l)m \\±M 2 {A) M 2 (A) J \ -1 m 



' /m JlM 2 (A)+M 2 (A) 



m — 1 V m 



We shall now use the same technique to compute E/ (x) 
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Lemma 8.20: Let f be a function in <5 m and let f(x) = tr (AP(x)) as in claim [KT5\ then 



i = E./ 4 (^) 



(m-l)(m-2)(m-3) 






^)) M *M) 



- 0(1)M 1 {A)M 3 (A) 

0{±)M?(A)M 2 (A) 

+ 0{m)M%{A) 

+ (m? + m) M±(A) 

- 0{m)M r (A) 

- 0{m)M c (A) 
+ 0(±)M q (A)) 

Proof: We have 

f\x) = tr {AP{x)f = tr ((A m ) (P m {x))) 

and therefore, 

E x f\x) = tr{{A m )E{P®\x))) 

Denote Q = E^P® (x). We need a closed formula for Q, which we shall obtain by diagonalizing 
it. Notice that P® 4 is a reducible representation. When taking the expectation K x P^ 4: (x), all 
the copies of p° will become 1, and the copies of other representations will zero out. The rank 
of Q is the multiplicity of p° in P® 4 , which is: 

< X p o,Xp®4 >=E x x P ®i( x ) =E X Y^ p ii( x ) P jj( x ) P kk(x)Pu(x) 

\i,j,k,l 

^E x P 4 (x) + 4- J2 ^ x Pi(x)Pn(x) + 6- ]T E x P? i (x)P jj (x)P kk (X)+ 

i iJAj^J i>3>k> distinct 

3 • J2 ^Pii{x)Pfj{x) + J2 ^xPn{x)P 3J {x)P kk {x)P l l{x) = 

hJj^J ijyk,l distinct 

m m(m — 1) m(m — l)(m — 2) 

h 4 -. - + 6 — ) - -+ 

m m{m — 1) m(m — l)(m — 2) 

m(m — 1) m(m — l)(m — 2)(m — 3) 
m(m — 1) m(m — l)(rn — 2)(m — 3) 

So Q's only nonzero eigenvalue is 1 with multiplicity 15. As before, Q is symmetric. Eigen- 
vectors of Q are of size m 4 . We now show 15 eigenvectors of Q, divided into 5 groups. We list 
them by their entries, indexed by i,j,k,l G [m]: 

E 1 = {liljlfcl,} 

P*2 = {^ijlfelZj <5i,fcl.7'ljj ^,/ljlfc; ^j,khh, ^,/lilfc; ^fc^ljlj} 

E3 = {Si,j6k,i, Si,k5j,i, Si,i$j,k} 
E4 = {5ij t kh,8i,j,ilk,Si,k,ilj,8j,k,ili} 

E5 = {$i,j,k,l} 
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It is easy to see that for every x, These vectors are eigenvectors of P® 4 (x). Therefore, these 
vectors are also eigenvectors of Q. However, these vectors are not orthogonal. As before, let 
E be the 15 x (m?) matrix whose rows are these vectors. Let C = EE l . Then there exists a 
symmetric 15 x 15 matrix O such that OE is orthonormal, and 00 = C _1 . In appendix El we 
give the expression for C and E (A® 4 ) E l . We calculated C~ 1 using a mathematical software 
called Sage. The expression is too large to show it here. Using this result, we computed 
E x / 4 (x) = tr (E (A m ) E l C- 1 ) and got the result. 

□ 

We now wish to apply the result of lemma [8.201 on Ttf to obtain lemma [8. 171 We shall use the 
following two claims: 

Claim 8.21: Let / be a function in & m and let f(x) = tr (AP(x)) as in claim [8T5l then 
T t f(x) = tr (A'P(x)), where A' = aA+(l- a)^^J, and a = e~ l 

Proof: 

tr (A'P(x)) =tr( (a A + (1 - a)^^-j\ P(x)\ = 

tr (aAP(x)) + tr ( (1 - a)^^JP(x)) = af(x) + (1 - a)Ef = T t f(x) 
where we have used the fact that tr(JP(x)) = m and M\ (A) = m ■ E/ 



D 



■ M t(A) j , . D , „. _ (l-2i 



Claim 8.22: For A as in claim ELS and A' = aA+(l- aY^^-J, denote by r = ^-^-. We 

have: 

1. Mi(A') =Mi(A) 

2. M 2 (A') = a 2 M 2 (A) + 2arM 2 (A) + q 2 M?(A) 

3. M 3 (A') = a 3 M 3 (A) + 3cf 2 tM 1 (A)M 2 (A) + 3ar 2 Mf + t 3 M?(A) 



2 

m 



4. M 4 (A') = a 4 M 4 (A)+4a 3 TM 1 (A)M 3 (A)+6a 2 T 2 M^(A)M 2 (A)+AaT :i Mf(A)+T 4 Mf(A) 



2 

m 



5. Mr(A') = a A M r (A) + 4cj 3 tM 2 {A)M 2 {A)± + 2a 2 r 2 M 2 {A)M 2 (A)m + 4a 2 r 2 M 4 (A)± + 
Aar 3 Mf(A)m + T 4 M 4 (A)m 3 

6. M C (A') = a 4 M c (A) + Acj 3 tM 2 {A)M 2 {A)^ + 2a 2 r 2 M 2 {A)M 2 {A)m + 4a 2 T 2 M?(A)± + 
4aT 3 M?(A)m + T 4 M?(A)m 3 

7. M q (A') = a 4 M q (A) + 4(t 3 tM£(A)\ + 6a 2 r 2 M 4 (A) + AaT 3 M 4 (A)m 2 + r 4 M 4 (A)m 4 
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Proof: Since all the cases in this lemma are straightforward, we make do with demonstrating 
the proof of item number 4, and do not include the tedious, but simple calculations of the other 
cases. For item 4 we have: 

M 4 (A') = 2^ <WfciJi \ A ) ( : i 1 j 1 k 1 h)(i2i2k2h)^ i2 i 2k212 = 

hjikihi2J2k2h 
Yl S hhkih (° A + tMi(A) J)f iljlklh )( i2j2k2 i 2 ) S i2J2k2 i 2 

iijik 1 l 1 i 2 J2k 2 l2 

The above summation can be expanded into six types of summands: 

• 1 summand of the form 

° A Yl 6 njikih ( A )t 4 ijik 1 h)(i2J2k 2 l 2 ) 6 i2J2k 2 l2 = <? A M A (A) 

hjikihiijikzh 

• 4 summands of the form 

a\M 1 (A) J2 *i*Wi i A ® 3 ® J ) { n h k,W2i2k2l2) <W^ = ^rM 1 (A)M 3 (A) 

iijikihi2J2k2h 

• 4 summands of the form 

a 2 r 2 M 2 (A) £ S iljlklh (A® 2 ® J® 2 ) {iljlklh){i2j2k2l2} S i2j2k2l2 = 
hjikihizjikzh 

a 2 r 2 M 2 (A)M 2 (A) 

• 2 summands of the form 

a 2 r 2 M 2 (A) J2 Sitiikih (A® J® A® J) {iljlklh){i2J2k2 i 2) S i2J2 k 2 i 2 = 

iijik 1 l 1 i 2 J2k2h 

a 2 T 2 M?(A)M 2 (A) 

• 4 summands of the form 

ar 3 Mf(A) £ * ljlfclh (A ® ^ 3 ) (ilJ - lfclJl)(i3j - 3fcaJa) <W^ 2 = 

hjikihi2J2k2l2 

aT 3 Mf(A)M 1 (A) = aT 3 M?(A) 

• 1 summand of the form 

r'Mt(A) J2 ShJikih (J m ) {hhklh){i2j2k2l2) Si2nk2i2 = r*M?(A) m 2 

iljlklhl2J2k2l2 



D 
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Proof of lemma I8.17t This lemma is simply an application of lemma 18.201 to Ttf and an 
assignment of the values of the moments of A' given in claim 18.221 Since there are many terms 
involved in the assignment, we used Sage to get to the actual expression here as well. 

□ 

Proof of claim [87T8T : 

1. |Afi(A)| < 0(m): 

E/2(x) = t i U (A) + (m _ „ «M) > ' ( (m _ ,)«M) ^ 

m — 1 \ m z / m — 1 \ m z / 

|Mi(A)| <0(m) 

2. M 2 {A) < m-V. 

Kf\x) = 1 = -J— f M 2 (A) + (m - 1)^^) > -J— (M 2 (A)) => 
m — 1 \ m z / m — 1 

M 2 (A) < m - 1 

3. |M 3 A| < O (ml): Let a = E^-yl^, B = A- aJ, b 2 = EBfj. We take 6 to be positive. 
The items above imply that \a\ < O(^) and that 6 < -y/^- 

5Z A t> = 2Z^ a + Bi ^ = m2fl3 + 3aM z( B ) + 3a 2 Mi (5) + Af 3 (S) = 

mV + 3am 2 b 2 + M 3 (B) 

Given that the 2nd moment of B is m 2 b 2 , Jensen's inequality implies that |M3(.B)| < m 3 b s . 
Applying the bounds on a and b yields the result. 

4. M 4 (A) < 0(m 2 ): Let a,B and b be as in the previous item. 

2 A U = Z]( a + B v) 4 = m2fl4 + 4aM 3(^) + 6a 2 M 2 (£) + 4a 3 M x (B) + M 4 (B) = 

i,j hi 

m 2 a A + AaM 3 (B) + 6a 2 m 2 b 2 + M 4 (B) 

Again, given that the 3rd moment of B is m 2 b 2 , Jensen's inequality implies that |M3(S)| < 
m 3 b s and |M4(S)| < m 4 fo 4 . Applying the bounds on a and b yields the result. 

5. — M r {A) < 0, —M C {A) < 0: By definition, these operators are nonnegative. 

6. Mg(A) < M%(A) < 0(m 2 ): Notice that A A 1 is a symmetric PSD matrix. Let A be the 
vector of eigenvalues of AA (which are all nonnegative). We have M 2 {A) = tr^AA*) = 
Zi Ai and M q (A) = tr {AA'AA 1 ) = £. X 2 . Therefore, M q = £. A 2 < (£, ^if = ^ 2 

□ 
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9 Strategy Proofness 

Before discussing further work we wish to briefly discuss the notion of IR, that arises naturally 
when considering the spectrum of outcomes between SCF's and SWF's. The connection between 
IR and IIA is easy to understand, despite the fact that neither of these constraints implies the 
other. In this section we wish to touch upon the other end of the spectrum and discuss the 
connection between IR and strategy proofness. 

We present a definition of strategy proofness that is closely related to IR. We will show 
how our robust impossibility theorem for IR implies a robust impossibility theorem for this 
definition of strategy proofness. In this sense, our technique provides a single proof for the 
analogues of both Arrow's theorem (SWF's with independence) and Gibbard-Satterthwaite's 
theorem (SCF with strategy proofness). 

The definition we give here is based on the definition of Dietrich and List |DL07j for strategy 
proofness on judgment aggregation, which is a general framework that includes the setting of 
Arrow's theorem. In this framework, there is a permissible opinion space X C [k] m and we 
are interested in social aggregators of the form / : X n — > X. A widely studied topic was the 
characterization of spaces X for which independence implies dictatorship, as in Arrow's theorem 
and our setting. This characterization can be found in |NP10j and |DH10aj for k = 2 and in 
[DHlQb] for general k. In [DL07]. a general definition of strategy proofness was given for this 
framework was for k = 2, and was connected to independence. The definition we present here 
follows from an extension of the definition in [D L07] for any k, which also deals with the case 
where the output space of the function is different than the input space. 

Let H be a fixing subgroup of S m . Let / be / : S^ — > S m /H. We assume that for any 
alternative j £ [m], given the ranking of j by a certain voter, we are able to rank the voter's 
preference over all possible j-profiles of the outcome. 

Formally, define the set Qj to be the set of all possible j-profiles of a coset of H, Qj = 
{^ _1 (i)}_R-g§ m /h- F° r every alternative j £ [m] and ranking r £ [m], we assume that there 
exists some full transitive order relation on Qj. We denote it by < r ,j- It is natural to assume 
that for a specific ranking of the alternatives x, the top of the order < x - 1 (j),j wm be the j-profile 
of the coset that includes x. However, we do not demand that. 

A manipulation is a situation where for any alternative j, a voter i S [n] can report a false 
opinion and get better -/-profile of the outcome, according to his preference order < _i,... .. 
Formally, a manipulation exists when the following holds: 

^3^,x~\xi,yij~ x (x~ % ,Xi)(j) < -i • /^(aT^iXj) 

'i >J 

The manipulation power of a voter i is the rate of manipulations he can make: 

3 

The total manipulation power of / is M(f) = Yli-^iif)- f wm be called strategy-proof if 
M(f) = 0. The rationale behind this definition is that as the designers of a social aggregation 
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mechanism, we do not wish to specify up front for which alternatives j there might be a voter i 
who wishes to manipulate, and therefore we wish to be immune against all such manipulations. 



njfi -"if, 2 



We shall now connect IR(f) to M{f). Denote c = max KlK2& g m / H — ^ — a (Recall tiR- 

is the characteristic vector of a multiset K). For example, when H is the group of permutations 
fixing {1}, as in SCFs, we have c = ||(1, 0, ...0) - (0, ^, ..., ^ T )||| = ^. 

Claim 9.1: cM(f) > IR(f) 

Proof: Examine all pairs of profiles (x~ l ,Xi) and (x~ l ,yi), such that xj (j) = y^ (j) and 
f{x~ t ,Xi)^ l {j) ^ f \x~ l , yi) -1 (j) ■ By definition, those pairs of inputs contribute at most c to 

iR(f). 

Denote r = x^ l {j) = y^(j), K x = f {x~\ x,{)~ 1 {j) , K y = f (x~\ y^ 1 (j) . Since K x / K y , 
we must have either K x < r j K y or K y < r j K x . Therefore, this pair of inputs contribute 1 to 
M(f). 

□ 



As a corollary of this claim and theorem 13.10} we have 

Corollary 9.2: Let H C § m be a fixing subgroup o/S m . For m > 3, an H-social aggregator 
f for which M(f) < e is C>H{poly{m)e) close to a function that is either a constant function 
or dictatorial of the following form: there exists a voter i and a constant permutation y of the 
rankings such that f(x) = y o Xj. 



10 Further Work 

We have began to apply the techniques of this paper to Arrow's theorem (with relaxed indepen- 
dence constraint) and to Gibbard-Satterthwaite's theorem, with partial success. GS seems to 
offer many more new challenges for this scheme, as the Laplacian is not PSD and not Symmetric. 

It is also interesting to see how this technique applies to the generalized problem of judgment 
aggregation (see its definition in section [9]) . There are known combinatorial characterization 
results of functions satisfying independence in this setting, depending on the opinion space X, 
and our technique may be useful in finding robust versions of these theorems, wherever possible. 

Another direction is to study how our proof could be modified for groups other than S m . 

As mentioned earlier, our result can be interpreted as a 2 query dictatorship tester, for 
functions SJ^ — > S m /H. It is interesting to see whether this has any computational implications. 
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A Appendix: matrices for the proof of lemma 18.201 
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We compute C _1 using an algebraic software called .Sage. The expressions are too large to 
write here. Since all the entries of C are monomials in to, the determinant of C is a polynomial 
of a bounded degree in to, and therefore C is regular for all but a finite number of values of to. 
Actually, using Sage, we find the determinant of C to be m 15 (m — 1) 14 (to — 2) 7 (m — 3), so C 
is singular only for to = 0, 1, 2, 3. 

For the expression for .E (^4® 4 ) 22 we abuse the notation and use the sets 22j as matrices 
whose rows are the elements of 2*L. We have: 



£ 



£ 



^(A® 4 )^ £!(A® 4 )£; 2 ... 2?! (A® 4 ) 22 
22 5 (A® 4 )22* 22 5 (^ 4 )22| ... 22 5 (A® 4 ) E, 
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E 4 (A®*) E\ = \ 



/ MiM 3 ^ M?M 2 M\M 2 M?M 2 \ 

MlM 2 MiM 3 ^ M\M 2 M\M 2 

M\M 2 M\M 2 MiAf 3 i M\M 2 

\ MlM 2 MlM 2 M\M 2 MxMz^ ) 



E A {A 



' b m 



( M ± M 3 \ 



MiM 3 

M1M3 

V M ± M 3 J 

E 5 (A® 4 ) E\ = {M C M c M c ) 

E,{A^)E\ = {Mi) 

For the blocks that weren't explicitly mentioned, E { (A® 4 ) Ej = (Ei (A m ) E] 
We add here the protocol of the Sage code we used, for completeness: 

R . <m , al , al3 , al 12 , a22 , a4 , r22 , c22 , q , sig>=QQ [] 

C=matrix (R , 15,15, 

[ 
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# the variables al,al3,all2,a22,a4,r22,c22,q represent , repectively 

Mf(A),M l (A)M 3 (A),Mf(A)M 2 (A),M^(A),M 4 (A),M r (A),M c (A),M q (A) 

# the variables a,b,c,d,e,f,g,h represent, repectively 

Mt(A'),M l (A')M^A'),Ml(A')M 2 (A'),M^(A'),M 4 (A'),M r {A'),M c {A'),M q {A') 

a=al 

b=sig~3*al3 + 3*sig*(l-sig)~2*al/nT4 + 3*sig~2*(l-sig)*all2/m~2 + t~3*al*nT2 

c=sig~2*all2 + (2*sig*t+t~2)*al 

d=sig~4*a22 + 2*(2*sig*t+t~2)*sig~2*all2 + (2*sig*t+t~2) ~2*al 

e=sig~4*a4 + 4*sig*t~3*al + 6*sig~2*t~2*all2 + 4*sig~3*t*al3 + t~4*al*m~2 

f=sig~4*r22 + 4*sig~3*t*all2/m + sig~2*t~2*(2*all2*m + 4*al/m) + 4*sig*t~3*al*m 
+ t~4*al*m~3 

g=sig~4*c22 + 4*sig~3*t*all2/m + sig~2*t~2*(2*all2*m + 4*al/m) + 4*sig*t~3*al*m 
+ t~4*al*m~3 

h=sig~4*q + 4*sig~3*t*al/m~2 + 6*sig~2*t~2*al + 4*sig*t~3*al*m~2 + t~4*al*m~4 

#Sage can not work with matrices with rational entries. Therefore, we multiply 
all the variables by m~10 and the matrix E by m~3. 

a=a*nT10; b=b*m~10; c=c*m~10; d=d*nT10; e=e*nT10; f=f*m~10; g=g*nT10; h=h*m~10; 

E=matrix (R , 15,15, 

[ 



a*m~3 
a*m~l, a 

a*m"2 

a*m~2 

a*m~2 

a*m~2 

a*m~2 

a*m~2 

a*m~l 

a*m~l 

a*m"l 

a*m"l 

a*m~l 



a*m~2 , a*m~2 , a*m~2 , a*m~2 , a*m~2 , a*m~2 , a*m~ 1 , a*m~ 1 , a*m~ 1 , a*m~ 1 , a*m~ 1 , a*m~ 1 , 

c,a*m,a*m,a*m,a*m,a*m, c*m~2,a,a, c*m~2,c*m~2,a,a, c*m, 
a*m,c,a*m,a*m,a*m,a*m, a,c*nT2,a, c*nT2,a,c*nT2,a, c*m, 
a*m,a*m,c,a*m,a*m,a*m, a,a,c*m~2, a,c*nT2,c*nT2,a, c*m, 
a*m,a*m,a*m,c,a*m,a*m, a,a,c*m~2, c*nT2,a,a,c*nT2, c*m, 
a*m,a*m,a*m,a*m,c,a*m, a,c*nT2,a, a,c*nT2,a,c*nT2, c*m, 
a*m,a*m,a*m,a*m,a*m,c, c*nT2,a,a, a,a,c*nT2,c*nT2, c*m, 
c*m~2,a,a,a,a,c*m~2, d*m~3,h*m~3,h*m~3, c*m,c*m,c*m,c*m, f*m~3, 
a,c*m~2,a,a,c*m~2,a, h*m~3,d*m~3,h*m~3, c*m,c*m,c*m,c*m, f*m~3, 
a,a,c*m~2,c*m~2,a,a, h*m~3,h*m~3,d*m~3, c*m,c*m,c*m,c*m, f*m~3, 
c*m~2,c*m~2,a,c*m~2,a,a, c*m,c*m,c*m, b,c*m,c*m,c*m, b*m~2, 
c*m"2,a,c*m~2,a,c*m~2,a, c*m,c*m,c*m, c*m,b,c*m,c*m, b*m~2, 



45 



a*m"l, a,c*m~2,c*m~2,a,a,c*m~2, c*m,c*m,c*m, c*m,c*m,b,c*m, b*m~2, 

a*m~l, a,a,a,c*m~2,c*m~2,c*m~2, c*m,c*m,c*m, c*m,c*m,c*m,b, b*m~2, 

a, c*m,c*m,c*m,c*m,c*m,c*m, g*m~3,g*m~3,g*ni~3, b*m~2,b*m~2,b*m~2,b*m~2, e*m~3 

]) 

1=(C. inverse ()*E) . trace ()/m~ 13 

Km, 1 ,0,0,0,0,0,0,0, sig) 

l(m,0,l,0,0,0,0,0,0,sig) 

Km, 0,0, 1,0, 0,0, 0,0, sig) 

Km, 0,0, 0,1, 0,0, 0,0, sig) 

Km, 0,0, 0,0, 1,0, 0,0, sig) 

Km, 0,0, 0,0, 0,1, 0,0, sig) 

Km, 0,0, 0,0, 0,0, 1,0, sig) 

Km, 0,0, 0,0, 0,0, 0,1, sig) 
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